Regularities in football goal distributions 



L. C. Malacarne & R. S. Mendes 
Departamento de Fisica, Universidade Estadual de Maringd, Avenida Colombo 5790, 87020-900, Maringd-PR, Brazil 



o 
o 
o 

(N 

CD 



X3 

o 



> 
o 
o 

(N 

o 
o 
o 



I 

o 
o 



X 
S3 



Besides of complexities concerning to football champi- 
onships, it is identified some regularities in them. These 
regularities refer to goal distributions by goal-players and by 
games. In particular, the goal distribution by goal-players it 
well adjusted by the Zipf-Mandelbrot law, suggesting a conec- 
tion with an anomalous decay. 



Regularity in some complex systems can sometimes be 
identified and expressed in terms of simple laws. Typical 
examples of such situations are found in a wide range of 
contexts as the frequency of words in a long text [Q , the 
population distribution in big cities |^-^ , forest fires , 
the distribution of species lifetimes for North American 

6|, scientific citations [0,D, 
solar flares llllll, economic 



breeding bird populations 
www surfing ecology jlC 

index jl^ , epidemics in isolated populations , among 
others. Here, universal behaviours in the most popular 
sport, the football, are discussed. More precisely, this 
work focuses on regularities in goal distribution by goal- 
players and by games in championships. Furthermore, 
the goal distribution by goal-players is connected with 
an anomalous decay related to the Zipf-Mandelbrot [Qjl4| 
law and with Tsallis nonextensive statistical mechanics 

MM- 

In many contexts, it is common that few phenomena 
with high intensity arise, and so do many ones with low 
intensity. For instance, a long text generally contains 
many words that are employed in few opportunities and 
a small number that occurs largely The above men- 
tioned systems are good examples too. In particular, 
this kind of behaviour usually occurs in football champi- 
onships, because there are many players that make few 
goals in contrast with the topscorers. 

A detailed visualization of this behaviour can be well 
illustrated by considering some of the most competitive 
and traditional championships of the world. Our particu- 
lar choice of championships has been done guided by the 
criterion of easy accessibility of the corresponding data 
to anyone p9| , p0[ . Therefore, we consider, here, some of 
the main league football championships from Italy, Eng- 
land, Spain and Brazil Each of these championships 
has the participation of about twenty teams, contains 
around three hundred games, and approximately eight 
hundred goals |2^]. In Fig. |^ we exhibit data of these 
championships. In these graphics, the abscissa presents 
the number of goals x divided by an average of goals m 
(total number of goals per total number of goal-players). 



and the ordinate indicates the quantity N{x) of players 
with X goals divided by the quantity of players with one 
goal, NjX). The regular shape of the graphics presented 
in Fig. |l| suggests a general law to describe the distribu- 
tion of goals. 
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FIG. 1. Scaled distribution of goals in main league football 
championships from Italy, England, Spain and Brazil. The 
ordinate N{x) is the number of players with x goals divided 
by the number of players with one goal, N{1), and the ab- 
scissa is the number of goals divided by the average of goals 
m. These scaled data indicate a regularity in the goal distri- 
bution for goal-players. The European championships start in 
one year and finish in the subsequent year, and the Brazilian 
championships start and finish in the same year. 

In the study of the majority of the previously cited 
systems, the Zipf's law [Q, N{x) = a/x'', arises naturally, 
at least in part of the analysis. In the Zipf's law, a and 
b are constants and x is the independent variable. In 
order to give a better adjustment to a large part of the 
data, and based on information theory, Mandelbrot 
proposed N{x) — a/{c + x)'' as a generalization of the 
Zipf's law, with a, 6, and c all being constants. This Zipf- 
Mandelbrot 's distribution also arises in the context of 
a generalized statistical mechanics proposed some years 
ago p3,0,18[, equivalently rewritten as 



N{x)^No[l-il-q)Xx] 



(1) 



where iVg, A, a g are real parameters. In addition, this 
function satisfies an anomalous decay equation. 
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The parameter q can be considered as a measure of 
how anomalous the decay is. In particular, equation 
(|l|) is reduced to the usual exponential decay, N(x) = 
A^o exp(— Ace), in the limit (7 — > 1. 

Motivated by these physical connections, we employ 
the distribution (|^) to adjust the goals data. Following 
the construction of Fig. |l|, we use the number of goalplay- 
ers with one goal, N{1), and the average goal number by 
goalplayer, m, to eliminate Nq and A. Furthermore, it 
is a good approximation to replace the discrete average 
with a continuous one in the present analysis, i.e., 



1 



J^°°N{x) A(3-2g) 



{q < 3/2). (3) 



Thus, the distribution of goals dictated by equation (|l|) 
can be rewritten as 



Other kinds of regularities in goal distributions can be 
identified, but with different behaviours. This fact can be 
verified in the distribution of goals per game. Proceed- 
ing in a similar way as done in Fig. |^, it is considered 
normalized scale distribution of games and goals. In this 
case, the abscissa is the number x of goal divided by M, 
the mean goal per game of a championship (the number 
of goals of a championship divided by the correspond- 
ing number of games). In addition, the ordinate is given 
by the number of games with x goals of a championship 
divided by the number of games of the corresponding 
championship. Fig. ^ contains this kind of graphics illus- 
trating this regular behaviour by considering, again, the 
main league football championships from Italy, England, 
Spain and Brazil. As one can see, this figure strongly sug- 
gests a regularity in the distribution of goals for distinct 
championships . 
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where q becomes the unique parameter that remains to 
be adjusted, since N{1) and m are obtained directly from 
the data. Fig. || illustrates applications of equation (Q) 
for four championships, indicating therefore the goodness 
of the formula (^. The same conclusion is obtained in 
the other championships showed in Fig. |l|. Here, q — 1.33 
was employed as approximated value, leading to the Zipf- 
Mandelbrot's exponent 6 « 3. In this way, q ss 1.33 can 
be interpreted as the universal parameter for this kind 
of championships. Also, it is interesting to remark that 
& « 3 occurs in the distribution of scientific citations . 
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FIG. 2. Fit of the goal distribution for goal-players in Ital- 
ian (a), English (b), Spanish (c), and Brazilian (d) cham- 
pionships. The black circles are the goal data and the solid 
line is the fitting with the N{x) distribution being given in 
equation (H) with q — 1.33. 
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FIG. 3. Semi-log graphic of scaled distributions of goals per 
game is illustrated by considering championships from Italy, 
England, Spain and Brazil. For each championship, x is the 
number of goals per game, M is the average of goals per game, 
G{x) is the number of games with x goals, and Gt is the total 
number of games. 

Besides the numberless factors, including fluctuation 
due to relatively small number of teams and games that 
are present in a football championship, regularities arise 
in the goal distributions. In particular, the goal distri- 
bution for players that make goals are well adjusted by a 
Zipf-Mandelbrot law, suggesting a connection with ubiq- 
uitous phenomena such as anomalous diffusion. 
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Since the complete set of data for all championships of 
these countries is not easely available, we have used data 
that are both rencent and easely accessible. 
[22] In the European championships, the leagues are orga- 
nized in such way that each team plays two games with 
each of the other teams. In the Brazilian championships, 
each team plays only once with each one of the others. Af- 
ter this, the eight best teams go to the playoff. Neverthe- 
less, the number of games of the Brazilian championships 
in the playoff is much smaller than in the classification 
phase, and it doesn't affect the analysis when compared 
with European ones. 
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